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DETAILED ACTION 

1 . Claims 1-26 have been examined. 

Papers Received 

2. Receipt is acknowledged of the request for continued examination and 
amendment papers submitted, where the papers have been placed of record in the file. 

3. The 35 USC 1 1 2 rejections to the claims have been overcome by the 
amendment and are herein withdrawn. 

Claim Rejections - 35 USC § 103 

4. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed 
or described as set forth in section 102 of this title, if the differences between the 
subject matter sought to be patented and the prior art are such that the subject 
matter as a whole would have been obvious at the time the invention was made 
to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was 
made. 

5. Claims 1-18 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Kahle (5,467,473) in view of Yoaz (5,987,595) and in view of Lipasti (On the Value 
Locality of Store Instructions). 

6. In regard to claim 1, 

a. Kahle discloses an apparatus comprising: 

i. A processing section (figure 1 ); • 

ii. an extended load buffer; [Figure 5 gives a layout of a load queue 
which is also a buffer.] 
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iii. a marking processing section; [Figure 6, shows in step 4 that a load 
program number is placed or marked into the load queue (extended load 
buffer). Therefore a marking unit must exist to perform this marking.] 

iv. a comparing processing section; [Figure 6, step 6 and column 2, 
lines 56-59 show that a store address is compared to the load addresses 
of the load buffer. For this comparison to take place a comparison unit 
must exist.] 

v. and a recovery processing section; [Figure 6, steps 9 and 10 and 
column 3, lines 4-1 1 show that a load instruction must be placed in original 
order and reexecuted. It is shown that this is because of a conflict that 
exists and therefore original order must be recovered. This must be 
accomplished with a recovery unit.] 

vi. wherein unexecuted load instructions are advanced over silent 
store instructions. [Column 2, lines 56-59 show that when a store 
executes, it's address is compared to previously executed load 
instructions, in a load queue, which executed out of order ahead of the 
store. So in the timeframe of the store execution the loads have been 
previously executed. However, in order to execute the load instructions 
out of order ahead of the store, the unexecuted load instructions are 
inherently advanced over the store for execution and once executed 
become previously executed loads that were previously advanced over 
the store. This is further show in column 3, lines 57-66. An example is in 
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Figure 2 with column 5, line 46 - column 6, line 13 where a load is 
executed ahead of a store instruction and thus the load is advanced 
ahead of a store instruction for execution (and thus was unexecuted when 
advanced). When the data being stored by the store instruction is the 
same as the data already in the register, the store instruction is a silent 
store and was likewise advanced over by the load. Since load instructions 
can be advanced past any store, they will also be advanced past silent 
stores, which do not affect system state as defined in the specification.] 

b. Kahle does not disclose a predictor having a collision history table (CHT), 
said predictor for predicting silent store instructions, and that the processing 
section is coupled to the predictor. Kahle also does not disclose wherein the 
predictor compares an unexecuted load instruction with an issued and 
unexecuted store instruction value, and the unexecuted load instruction 
bypasses the issued store instruction for execution if the unexecuted load 
instruction value and the issued and unexecuted store value are the same. 

c. Yoaz has disclosed a predictor having a collision history table (CHT) 
(figure 3, element 88). Column 3, lines 50-52 show that the CHT is used for 
predicting and thus is part of a predictor along with the control unit (figure 3, 
element 102). Figure 3 shows that the CHT or predictor is coupled to a recorder 
buffer, 94, using some control logic. Column 6, lines 35-36, shows that this 
buffer holds entries for load instructions. Yoaz has disclosed in column 3, lines 
50-60 that the predictor is used for predicting load instructions so that loads can 
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be executed ahead of stores. Lipasti discusses the notion of a silent store on 
page 183, column 2, last paragraph. It has the same definition as given by the 
applicant. Lipasti mentions on page 183, column 2, third paragraph that stride 
prediction is used. One will notice that in this same paragraph Lipasti speaks of 
silent stores and how a tagged last value predictor's limit is reached by these 
silent store and that the stride predictor is used by their design because it is not 
limited by silent stores and thus can more accurately predict stores including 
silent stores. Further proof of the use of silent store prediction is shown in 
section 3.4. It is shown that a "perfect method" is used to squash silent stores 
once they are predicted. The Examiner is taking Official notice that it is well 
known to one of ordinary skill in the art to compare an unexecuted load 
instruction with an unexecuted store instruction value, and have the unexecuted 
load instruction bypass the issued store instruction for execution if the 
unexecuted load instruction value and the issued and unexecuted store value are 
the same. 

d. Yoaz has shown in column 2, lines 58-63 that his method is able to 
execute more load instructions out of order (based on the predictor) for faster 
processor operations. These faster processor operations would have motivated 
one of ordinary skill in the art to modify the design of Kahle to use the collision 
history table and predictor described by Yoaz. Page 185, section 3.1 of Lipasti 
then shows that squashing these silent stores (using prediction as shown in 
section 3.4 under the perfect method) allows a designer to obtain greater 
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performance from existing structures, or a reduction in size or complexity of the 
system. This ability to obtain greater performance or reduction in size would 
have motivated one of ordinary skill in the art to modify the design of Kahle to 
include the silent store prediction given by Lipasti. With these modifications in 
place the design now predicts what loads can be advanced ahead of stores, 
effectively predicting silent stores as shown above. One of ordinary skill in the 
art would have recognized the advantages of comparing an unexecuted load 
instruction with an unexecuted store instruction value, and have the unexecuted 
load instruction bypass the issued store instruction for execution if the 
unexecuted load instruction value and the issued and unexecuted store value are 
the same so that a memory access is avoided and performance is sped up by 
advancing a load over a store (and squashing the store) while sending data 
directly from the register in the store instruction to the register destination of the 
load instruction. 

It would have been obvious to one of ordinary skill in the art at the time of invention to 
modify the design of Kahle to include a predictor having a collision history table as 
disclosed by Yoaz that predicts silent stores as taught by Lipasti so that processor 
operations may be sped up and greater performance or smaller area may be realized. 
It would have been obvious to one of ordinary skill in the art to compare an unexecuted 
load instruction with an unexecuted store instruction value, and have the unexecuted 
load instruction bypass the issued store instruction for execution if the unexecuted load 
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instruction value and the issued and unexecuted store value are the same so that a 
memory access is avoided and performance is sped up. 

7. In regard to claim 2, Kahle in view of Yoaz and further in view of Lipasti has 
disclosed the apparatus of claim 1 , wherein the predictor is a silent store predictor, as 
described above. 

8. In regard to claim 3, Kahle in view of Yoaz and further in view of Lipasti has 
disclosed the apparatus of claim 2, as described above, wherein the silent store 
predictor uses path based indexing and the path is based on branches. [Column 5, 
lines 9-23 of Yoaz shows how the CHT (the predictor) is used. This section shows that 
the sequence of instructions is based on the correct prediction of branches. As shown 
in column 4, lines 8-10, the tag of the CHT is the linear instruction pointer. Thus, the 
predictor is indexed based on the linear sequence of instructions that is used based on 
branches.] 

9. In regard to claim 4, Kahle in view of Yoaz and further in view of Lipasti has 
disclosed the apparatus of claim 3, wherein the silent store predictor is coupled with a 
state machine. [Column 4, lines 43-45 of Yoaz show that the CHT includes prediction 
bits being either sticky or saturating counters. A saturating counter in itself is a state 
machine because it varies its state or value based on inputs.] 

1 0. In regard to claim 5, Kahle in view of Yoaz and further in view of Lipasti has 
disclosed the apparatus of claim 4, wherein the state machine is one of a 1-bit, 2-bit, 
and a sticky bit. [Column 4, lines 43-45 of Yoaz show that the CHT includes prediction 
bits being either sticky or saturating counters.] 
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11. In regard to claim 6, Kahle in view of Yoaz and further in view of Lipasti has 
disclosed the apparatus of claim 1 , as described above, wherein the predictor is 
memory dependent. [Column 3, lines 54-60 of Yoaz show that predictor is based on 
memory addresses and thus is memory dependant] 

1 2. In regard to claim 7, Kahle in view of Yoaz and further in view of Lipasti has 
disclosed the apparatus of claim 1 , wherein the extended load buffer comprises bit 
fields to mark load address match, load data match, load predict, and load flush, and bit 
fields for load address, load attribute and load data. [As described above, there is no 
reference in the specification for the elements 1 1 10, 1 130, 1 140, and 1 150: the load 
address match, load data match, and load flush, and load data. Therefore, these fields 
will be given a reasonable common English meaning. Also, the load attribute field is not 
defined explicitly and the same rule will be applied to it. As shown in figure 5, the 
extended load buffer holds a load address. This address is updated as a result of a 
load instruction or an instruction that was matched as a load. Therefore, this field is 
also the load address match field. Figure 5 also shows that the table includes a PC 
field, which gives the age of the instruction. This is load data of a load instruction, which 
is also a load attribute. Since the data is written there upon realizing that an instruction 
matches a load instruction, the field is also a load data match. Column 9, lines 10-21 
show that a load can be marked upon a match indicating that the load must be re- 
executed. Since the extended load buffer holds information for the loads, it is clear that 
this buffer would then hold the marking bits in such an embodiment as the prediction 
bits. Since the marking bits set above are marked not only on a match of addresses but 
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also on improper ordering, these bits also signify a load flush because the load and 
subsequent instructions need to be flushed for re-execution as shown previously.] 

13. In regard to claim 8, Kahle in view of Yoaz and further in view of Lipasti has 
disclosed the apparatus of claim 1 , as described above, wherein the CHT is one of 
indexed by a tag and tagless. [Yoaz has shown in figure 2A a tagged CHT.] 

14. In regard to claim 9, Kahle in view of Yoaz and further in view of Lipasti has 
disclosed the apparatus of claim 1 , as described above, wherein the CHT includes 
distance bits. [Yoaz has shown in figure 2D a CHT including distance bits.] 

15. In regard to claim 10, 

a. Kahle discloses an apparatus comprising: 

i. a processor (figure 1 ) having internal memory (figure 1 , element 1 ); 

ii. a bus coupled to the processor (figure 1 , element 2); 

iii. a memory coupled to a memory controller and the processor; 
[Column 2, line 65 - column 3, line 1 shows a memory used by and thus 
coupled to the processor. It is inherent that the memory has control logic 
so that it can be manipulated.] 

iv. an extended load buffer; [Figure 5 gives a layout of a load queue 
which is also a buffer.] 

v. a marking process; [Figure 6, shows in step 4 that a load program 
number is placed or marked into the load queue (extended load buffer). 
Therefore a marking unit must exist to perform this marking.] 
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vi. a comparing process; [Figure 6, step 6 and column 2, lines 56-59 
show that a store address is compared to the load addresses of the load 
buffer. For this comparison to take place a comparison unit must exist.] 

vii. and a recovery process; [Figure 6, steps 9 and 10 and column 3, 
lines 4-1 1 show that a load instruction must be placed in original order and 
reexecuted. It is shown that this is because of a conflict that exists and 
therefore original order must be recovered. This must be accomplished 
with a recovery unit.] 

viii. wherein unexecuted load instructions are advanced over silent 
store instructions. [Column 2, lines 56-59 show that when a store 
executes, it's address is compared to previously executed load 
instructions, in a load queue, which executed out of order ahead of the 
store. So in the timeframe of the store execution the loads have been 
previously executed. However, in order to execute the load instructions 
out of order ahead of the store, the unexecuted load instructions are 
inherently advanced over the store for execution and once executed 
become previously executed loads that were previously advanced over 
the store. This is further show in column 3, lines 57-66. An example is in 
Figure 2 with column 5, line 46 - column 6, line 13 where a load is 
executed ahead of a store instruction and thus the load is advanced 
ahead of a store instruction for execution (and thus was unexecuted when 
advanced). When the data being stored by the store instruction is the 
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same as the data already in the register, the store instruction is a silent 
store and was likewise advanced over by the load. Since load instructions 
can be advanced past any store, they will also be advanced past silent 
stores, which do not affect system state as defined in the specification.] 

b. Kahle does not disclose a predictor having a collision history table (CHT), 
said predictor for predicting silent store instructions, or that the extended load 
buffer is coupled to the predictor. Kahle also does not disclose wherein the 
predictor compares an unexecuted load instruction with an issued and 
unexecuted store instruction value, and the unexecuted load instruction 
bypasses the issued store instruction for execution if the unexecuted load 
instruction value and the issued and unexecuted store value are the same. 

c. Yoaz has disclosed a predictor having a collision history table (CHT) 
(figure 3, element 88). Column 3, lines 50-52 show that the CHT is used for 
predicting and thus is part of a predictor along with the control unit (figure 3, 
element 102). Figure 3 shows that the CHT or predictor is coupled to a recorder 
buffer, 94, using some control logic. Column 6, lines 35-36, shows that this 
buffer holds entries for load instructions. Yoaz has disclosed in column 3, lines 
50-60 that the predictor is used for predicting load instructions so that loads can 
be executed ahead of stores. Lipasti discusses the notion of a silent store on 
page 183, column 2, last paragraph. It has the same definition as given by the 
applicant. Lipasti mentions on page 183, column 2, third paragraph that stride 
prediction is used. One will notice that in this same paragraph Lipasti speaks of 
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silent stores and how a tagged last value predictor's limit is reached by these 
silent store and that the stride predictor is used by their design because it is not 
limited by silent stores and thus can more accurately predict stores including 
silent stores. Further proof of the use of silent store prediction is shown in 
section 3.4. It is shown that a "perfect method" is used to squash silent stores 
once they are predicted. The Examiner is taking Official notice that it is well 
known to one of ordinary skill in the art to compare an unexecuted load 
instruction with an unexecuted store instruction value, and have the unexecuted 
load instruction bypass the issued store instruction for execution if the 
unexecuted load instruction value and the issued and unexecuted store value are 
the same. 

d. Yoaz has shown in column 2, lines 58-63 that his method is able to 
execute more load instructions out of order (based on the predictor) for faster 
processor operations. These faster processor operations would have motivated 
one of ordinary skill in the art to modify the design of Kahle to use the collision 
history table and predictor described by Yoaz. Page 185, section 3.1 of Lipasti 
then shows that squashing these silent stores (using prediction as shown in 
section 3.4 under the perfect method) allows a designer to obtain greater 
performance from existing structures, or a reduction in size or complexity of the 
system. This ability to obtain greater performance or reduction in size would 
have motivated one of ordinary skill in the art to modify the design of Kahle to 
include the silent store prediction given by Lipasti. With these modifications in 
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place the design now predicts what loads can be advanced ahead of stores, 
effectively predicting silent stores as shown above. One of ordinary skill in the 
art would have recognized the advantages of comparing an unexecuted load 
instruction with an unexecuted store instruction value, and have the unexecuted 
load instruction bypass the issued store instruction for execution if the 
unexecuted load instruction value and the issued and unexecuted store value are 
the same so that a memory access is avoided and performance is sped up by 
advancing a load over a store (and squashing the store) while sending data 
directly from the register in the store instruction to the register destination of the 
load instruction. 

It would have been obvious to one of ordinary skill in the art at the time of invention to 
modify the design of Kahle to include a predictor having a collision history table as 
disclosed by Yoaz that predicts silent stores as taught by Lipasti so that processor 
operations may be sped up and greater performance or smaller area may be realized. 
It would have been obvious to one of ordinary skill in the art to compare an unexecuted 
load instruction with an unexecuted store instruction value, and have the unexecuted 
load instruction bypass the issued store instruction for execution if the unexecuted load 
instruction value and the issued and unexecuted store value are the same so that a 
memory access is avoided and performance is sped up.ln regard to claim 11, Kahle in 
view of Yoaz and further in view of Lipasti has disclosed the apparatus of claim 10, 
wherein the predictor is a silent store predictor, as described above. 
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1 6. In regard to claim 1 2, Kahle in view of Yoaz and further in view of Lipasti has 
disclosed the apparatus of claim 1 1 , as described above, wherein the silent store 
predictor uses path based indexing and the path is based on branches. [Column 5, 
lines 9-23 of Yoaz shows how the CHT (the predictor) is used. This section shows that 
the sequence of instructions is based on the correct prediction of branches. As shown 
in column 4, lines 8-10, the tag of the CHT is the linear instruction pointer. Thus, the 
predictor is indexed based on the linear sequence of instructions that is used based on 
branches.] 

1 7. In regard to claim 1 3, Kahle in view of Yoaz and further in view of Lipasti has 
disclosed the apparatus of claim 12, wherein the silent store predictor is coupled with a 
state machine. [Column 4, lines 43-45 of Yoaz show that the CHT includes prediction 
bits being either sticky or saturating counters. A saturating counter in itself is a state 
machine because it varies its state or value based on inputs.] 

1 8. In regard to claim 14, Kahle in view of Yoaz and further in view of Lipasti has 
disclosed the apparatus of claim 13, wherein the state machine is one of a 1-bit, 2-bit, 
and a sticky bit. [Column 4, lines 43-45 of Yoaz show that the CHT includes prediction 
bits being either sticky or saturating counters.] 

1 9. In regard to claim 1 5, Kahle in view of Yoaz and further in view of Lipasti has 
disclosed the apparatus of claim 10, as described above, wherein the predictor is 
memory dependent. [Column 3, lines 54-60 of Yoaz show that predictor is based on 
memory addresses and thus is memory dependant.] 
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20. In regard to claim 1 6, Kahle in view of Yoaz and further in view of Lipasti has 
disclosed the apparatus of claim 10, wherein the extended load buffer comprises bit 
fields to mark load address match, load data match, load predict, and load flush, and bit 
fields for load address, load attribute and load data. [As described above, there is no 
reference in the specification for the elements 1110, 1130, 1140, and 1150: the load 
address match, load data match, and load flush, and load data. Therefore, these fields 
will be given a reasonable common English meaning. Also, the load attribute field is not 
defined explicitly and the same rule will be applied to it. As shown in figure 5, the 
extended load buffer holds a load address. This address is updated as a result of a 
load instruction or an instruction that was matched as a load. Therefore, this field is 
also the load address match field. Figure 5 also shows that the table includes a PC 
field, which gives the age of the instruction. This is load data of a load instruction, which 
is also a load attribute. Since the data is written there upon realizing that an instruction 
matches a load instruction, the field is also a load data match. Column 9, lines 10-21 
show that a load can be marked upon a match indicating that the load must be re- 
executed. Since the extended load buffer holds information for the loads, it is clear that 
this buffer would then hold the marking bits in such an embodiment as the prediction 
bits. Since the marking bits set above are marked not only on a match of addresses but 
also on improper ordering, these bits also signify a load flush because the load and 
subsequent instructions need to be flushed for re-execution as shown previously.] 
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21 . In regard to claim 17, Kahle in view of Yoaz and further in view of Lipasti has 
disclosed the apparatus of claim 10, as described above, wherein the CHT is one of 
indexed by a tag and tagless. [Yoaz has shown in figure 2A a tagged CHT.] 

22. In regard to claim 18, Kahle in view of Yoaz and further in view of Lipasti has 
disclosed the apparatus of claim 10, as described above, wherein the CHT includes 
distance bits. [Yoaz has shown in figure 2D a CHT including distance bits.] 

23. Claims 19-26 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Kahle in view of Lipasti (On the Value Locality of Store Instructions). 

24. In regard to Claim 19, 

a. Kahle discloses a method comprising: 

i. fetching an instruction (figure 6, step 1 ) and determining if an 
instruction is one of a store and a load (figure 6, step 3); 

ii. issuing the store instruction; [Figure 6, step 5 shows that stores are 
executed and thus must by issued in the pipeline.] 

iii. comparing an address and data of the store address with load 
instructions in an extended load buffer (figure 6, steps 6 and 8); [Figure 5 
gives a layout of the load queue used in figure 6 to hold load information. 
This queue is also a buffer. The PC value indicates a program number for 
comparison of age of the instructions and thus is data of the instructions.] 

iv. setting marking bits in the extended load buffer if a match is found 
in the comparing; [Column 9, lines 10-21 show that a load can be marked 
upon a match indicating that the load must be re-executed. Since the 
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extended load buffer holds information for the loads, it is clear that this 
buffer would then hold the marking bits in such an embodiment.] 

v. updating a memory with store instruction if the store instruction can 
be retired; [Column 6, lines 37-39 show that the memory is updated when 
the result of a store is committed or retired.] 

vi. and bypassing a store instruction and executing the load instruction 
ahead of the silent store instruction. [Column 2, lines 56-59 show that 
when a store executes, it's address is compared to previously executed 
load instructions, in a load queue, which executed out of order ahead of 
the store. So in the timeframe of the store execution the loads have been 
previously executed. However, in order to execute the load instructions 
out of order ahead of the store, the unexecuted load instructions are 
inherently advanced over the store for execution and once executed 
become previously executed loads that were previously advanced over 
the store. This is further show in column 3, lines 57-66. An example is in 
Figure 2 with column 5, line 46 - column 6, line 13 where a load is 
executed ahead of a store instruction and thus the load is advanced 
ahead of a store instruction for execution (and thus was unexecuted when 
advanced). When the data being stored by the store instruction is the 
same as the data already in the register, the store instruction is a silent 
store and was likewise advanced over by the load. Since load instructions 
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can be advanced past any store, they will also be advanced past silent 
stores, which do not affect system state as defined in the specification.] 

b. Kahle does not disclose performing a silent store prediction if the 
instruction is a store nor does Kahle disclose bypassing a predicted silent store if 
an unexecuted load instruction value matches the issued and unexecuted store 
instruction value; 

c. Lipasti discusses the notion of a silent store on page 1 83, column 2, last 
paragraph. It has the same definition as given by the applicant. Lipasti mentions 
on page 183, column 2, third paragraph that stride prediction is used. One will 
notice that in this same paragraph Lipasti speaks of silent stores and how a 
tagged last value predictor's limit is reached by these silent store and that the 
stride predictor is used by their design because it is not limited by silent stores 
and thus can more accurately predict stores including silent stores. Further proof 
of the use of silent store prediction is shown in section 3.4. It is shown that a 
"perfect method" is used to squash silent stores once they are predicted. The 
Examiner is taking Official notice that it is well known to one of ordinary skill in 
the art to compare an unexecuted load instruction with an unexecuted store 
instruction value, and have the unexecuted load instruction bypass the issued 
store instruction for execution if the unexecuted load instruction value and the 
issued and unexecuted store value are the same. 

d. Page 185, section 3.1 of Lipasti then shows that squashing these silent 
stores (using prediction as shown in section 3.4 under the perfect method) allows 
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a designer to obtain greater performance from existing structures, or a reduction 
in size or complexity of the system. This ability to obtain greater performance or 
reduction in size would have motivated one of ordinary skill in the art to modify 
the design of Kahle to include the silent store prediction given by Lipasti. With 
these modifications in place the design now predicts what loads can be 
advanced ahead of stores, effectively predicting silent stores as shown above. 
One of ordinary skill in the art would have recognized the advantages of 
comparing an unexecuted load instruction with an unexecuted store instruction 
value, and have the unexecuted load instruction bypass the issued store 
instruction for execution if the unexecuted load instruction value and the issued 
and unexecuted store value are the same so that a memory access is avoided 
and performance is sped up by advancing a load over a store (and squashing the 
store) while sending data directly from the register in the store instruction to the 
register destination of the load instruction. 
It would have been obvious to one of ordinary skill in the art at the time of invention to 
modify the design of Kahle to predict silent stores as taught by Lipasti so that processor 
operations may be sped up and greater performance or smaller area may be realized. 
It would have been obvious to one of ordinary skill in the art to compare an unexecuted 
load instruction with an unexecuted store instruction value, and have the unexecuted 
load instruction bypass the issued store instruction for execution if the unexecuted load 
instruction value and the issued and unexecuted store value are the same so that a 
memory access is avoided and performance is sped up. 
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25. In regard to claim 20, Kahle in view of Lipasti has disclosed the method of claim 
19, as described above, further comprising preparing the load instruction for retirement, 
if the load instruction is complete, and determining if the load instruction is marked flush 
in the extended load buffer. [Column 6, lines 20-24 of Kahle show that a load is 
committed or retired. In preparation for this, the address and program number are 
removed from the load queue. Since the marking bits set above are marked not only on 
a match of addresses but also on improper ordering, these bits also signify a load flush 
because the load and subsequent instructions need to be flushed for re-execution as 
shown previously.] 

26. In regard to claim 22, Kahle in view of Lipasti discloses the method of claim 19, 
wherein the memory is a cache. [As shown in column 6, lines 37-39 of Kahle, the 
completed store operation writes to a memory via a cache, thus the operation writes to 
the cache memory as well as a memory.] 

27. In regard to claim 23, 

a. Kahle discloses a program storage device readable by a machine 
comprising instructions that cause the machine to: 

i. fetch an operation (figure 6, step 1 ) and determining if an 
instruction is one of a store instruction and a load instruction (figure 6, step 

3); 

ii. execute the store instruction (figure 6, step 5); 

iii. compare an address and data of the store operation with load 
operations in an extended load buffer (figure 6, steps 6 and 8); [Figure 5 
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gives a layout of the load queue used in figure 6 to hold load information. 
This queue is also a buffer. The PC value indicates a program number for 
comparison of age of the instructions and thus is data of the instructions.] 

iv. setting marking bits in the extended load buffer if a match is found 
in the compare instruction; [Column 9, lines 10-21 show that a load can be 
marked upon a match indicating that the load must be re-executed. Since 
the extended load buffer holds information for the loads, it is clear that this 
buffer would then hold the marking bits in such an embodiment.] 

v. update a memory with store operation if the store operation can be 
retired; [Column 6, lines 37-39 show that the memory is updated when the 
result of a store is committed or retired.] 

vi. and bypass a silent store operation and execute a load operation 
ahead of the silent store operation if the operation is a load. [Column 2, 
lines 56-59 show that when a store executes, it's address is compared to 
previously executed load instructions, in a load queue, which executed out 
of order ahead of the store. So in the timeframe of the store execution the 
loads have been previously executed. However, in order to execute the 
load instructions out of order ahead of the store, the unexecuted load 
instructions are inherently advanced over the store for execution and once 
executed become previously executed loads that were previously 
advanced over the store. This is further show in column 3, lines 57-66. 
An example is in Figure 2 with column 5, line 46 - column 6, line 13 where 
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a load is executed ahead of a store instruction and thus the load is 
advanced ahead of a store instruction for execution (and thus was 
unexecuted when advanced). When the data being stored by the store 
instruction is the same as the data already in the register, the store 
instruction is a silent store and was likewise advanced over by the load. 
Since load instructions can be advanced past any store, they will also be 
advanced past silent stores, which do not affect system state as defined in 
the specification.] 

b. Kahle does not disclose performing a silent store prediction if the 
operation is a store instruction nor does Kahle disclose bypassing a predicted 
silent store if an unexecuted load instruction value matches the issued and 
unexecuted store instruction value; 

c. Lipasti discusses the notion of a silent store on page 183, column 2, last 
paragraph. It has the same definition as given by the applicant. Lipasti mentions 
on page 183, column 2, third paragraph that stride prediction is used. One will 
notice that in this same paragraph Lipasti speaks of silent stores and how a 
tagged last value predictor's limit is reached by these silent store and that the 
stride predictor is used by their design because it is not limited by silent stores 
and thus can more accurately predict stores including silent stores. Further proof 
of the use of silent store prediction is shown in section 3.4. It is shown that a 
"perfect method" is used to squash silent stores once they are predicted. The 
Examiner is taking Official notice that it is well known to one of ordinary skill in 
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the art to compare an unexecuted load instruction with an unexecuted store 
instruction value, and have the unexecuted load instruction bypass the issued 
store instruction for execution if the unexecuted load instruction value and the 
issued and unexecuted store value are the same. 

d. Page 185, section 3.1 of Lipasti then shows that squashing these silent 
stores (using prediction as shown in section 3.4 under the perfect method) allows 
a designer to obtain greater performance from existing structures, or a reduction 
in size or complexity of the system. This ability to obtain greater performance or 
reduction in size would have motivated one of ordinary skill in the art to modify 
the design of Kahle to include the silent store prediction given by Lipasti. With 
these modifications in place the design now predicts what loads can be 
advanced ahead of stores, effectively predicting silent stores as shown above. 
One of ordinary skill in the art would have recognized the advantages of 
comparing an unexecuted load instruction with an unexecuted store instruction 
value, and have the unexecuted load instruction bypass the issued store 
instruction for execution if the unexecuted load instruction value and the issued 
and unexecuted store value are the same so that a memory access is avoided 
and performance is sped up by advancing a load over a store (and squashing the 
store) while sending data directly from the register in the store instruction to the 
register destination of the load instruction. 
It would have been obvious to one of ordinary skill in the art at the time of invention to 
modify the design of Kahle to predict silent stores as taught by Lipasti so that processor 
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operations may be sped up and greater performance or smaller area may be realized. 
It would have been obvious to one of ordinary skill in the art to compare an unexecuted 
load instruction with an unexecuted store instruction value, and have the unexecuted 
load instruction bypass the issued store instruction for execution if the unexecuted load 
instruction value and the issued and unexecuted store value are the same so that a 
memory access is avoided and performance is sped up. 

28. In regard to claim 24, Kahle in view of Lipasti has disclosed the method of claim 
23, as described above, wherein the instructions further cause the machine to prepare 
the load operation for retirement if the load operation is complete, and determining if the 
load operation is marked flush in the extended load buffer. [Column 6, lines 20-24 of 
Kahle show that a load is committed or retired. In preparation for this, the address and 
program number are removed from the load queue. Since the marking bits set above 
are marked not only on a match of addresses but also on improper ordering, these bits 
also signify a load flush because the load and subsequent instructions need to be 
flushed for re-execution as shown previously.] 

29. In regard to claim 26, Kahle in view of Lipasti discloses the program storage 
device of claim 23, wherein the memory is a cache. [As shown in column 6, lines 37-39 
of Kahle, the completed store operation writes to a memory via a cache, thus the 
operation writes to the cache memory as well as a memory.] 

30. Claims 21 and 25 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Kahle in view of Lipasti as applied to claims 19-20, 22-24, and 26 above, and 
further in view of Yoaz. 
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31. In regard to claim 21, 

a. Kahle in view of Lipasti has disclosed the method of claim 19, as shown 
above, 

b. Kahle in view of Lipasti does not disclose wherein the predicting includes 
marking bits in a collision history table (CHT). 

c. Yoaz has disclosed a wherein the predicting includes marking bits in a 
collision history table (CHT) (figure 3, element 88). Column 3, lines 50-52 show 
that the CHT is used for predicting. Column 5, lines 57-67, show updating or 
marking the CHT. 

d. Yoaz has shown in column 2, lines 58-63 that his method is able to 
execute more load instructions out of order (based on the predictor) for faster 
processor operations. These faster processor operations would have motivated 
one of ordinary skill in the art to modify the design of Kahle in view of Lipasti to 
use the collision history table and predictor described by Yoaz. 

It would have been obvious to one of ordinary skill in the art at the time of invention to 
modify the design of Kahle in view of Lipasti to include the collision history table 
predictor disclosed by Yoaz so that processor operations may be sped up. 

32. In regard to claim 25, 

- a. Kahle in view of Lipasti has disclosed the method of claim 23, as shown 
above, 
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b. Kahle in view of Lipasti does not disclose wherein the instruction that 
causes the machine to predict silent stores includes an instruction that causes 
the machine to mark bits in a collision history table (CHT). 

c. Yoaz has disclosed a wherein the instruction that causes the machine to 
predict silent stores includes an instruction that causes the machine to mark bits 
in a collision history table (CHT) (figure 3, element 88). Column 3, lines 50-52 
show that the CHT is used for predicting. Column 5, lines 57-67, show updating 
or marking the CHT. 

d. Yoaz has shown in column 2, lines 58-63 that his method is able to 
execute more load instructions out of order (based on the predictor) for faster 
processor operations. These faster processor operations would have motivated 
one of ordinary skill in the art to modify the design of Kahle in view of Lipasti to 
use the collision history table and predictor described by Yoaz. 

It would have been obvious to one of ordinary skill in the art at the time of invention to 
modify the design of Kahle in view of Lipasti to include the collision history table 
predictor disclosed by Yoaz so that processor operations may be sped up. 

Response to Arguments 

33. Applicant's arguments have been fully considered but they are not persuasive. 

34. Applicant has argued that the prior art of record does not disclose the limitation of 
claims 1 , 10, 19, and 23 requiring that unexecuted load instructions are advanced over 
silent store instructions. Column 2, lines 56-59 show that when a store executes, it's 
address is compared to previously executed load instructions, in a load queue, which 



Application/Control Number: 09/752,796 Page 27 

Art Unit: 2183 

executed out of order ahead of the store. So in the timeframe of the store execution the 
loads have been previously executed. However, in order to execute the load 
instructions out of order ahead of the store, the unexecuted load instructions are 
inherently advanced over the store for execution and once executed become previously 
executed loads that were previously advanced over the store. This is further show in 
column 3, lines 57-66. An example is in Figure 2 with column 5, line 46 - column 6, line 
13 where a load is executed ahead of a store instruction and thus the load is advanced 
ahead of a store instruction for execution (and thus was unexecuted when advanced). 
When the data being stored by the store instruction is the same as the data already in 
the register, the store instruction is a silent store and was likewise advanced over by the 
load. Since load instructions can be advanced past any store, they will also be 
advanced past silent stores, which do not affect system state as defined in the 
specification. 

35. Applicant has argued on pages that the prior art of record does not teach silent 
store prediction as disclosed in claims 1, 10, 19, and 23. Lipasti discusses the notion of 
a silent store on page 183, column 2, last paragraph. It has the same definition as 
given by the applicant. Lipasti mentions on page 183, column 2, third paragraph that 
stride prediction is used. One will notice that in this same paragraph Lipasti speaks of 
silent stores and how a tagged last value predictor's limit is reached by these silent 
store and that the stride predictor is used by their design because it is not limited by 
silent stores and thus can more accurately predict stores including silent stores. Further 
proof of the use of silent store prediction is shown in section 3.4. It is shown that a 
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"perfect method" is used to squash silent stores once they are predicted. Page 185, 
section 3.1 of Lipasti then shows that squashing these silent stores (using prediction as 
shown in section 3.4 under the perfect method) allows a designer to obtain greater 
performance from existing structures, or a reduction in size or complexity of the system. 
This ability to obtain greater performance or reduction in size would have motivated one 
of ordinary skill in the art to modify the design of Kahle to include the silent store 
prediction given by Lipasti. With these modifications in place the design now predicts 
what loads can be advanced ahead of stores, effectively predicting silent stores as 
shown above. Further, the 35 USC 103 modifications are proper since the prediction of 
silent stores is disclosed with proper motivation as given above. 

36. The Applicant also argues that The Examiner does not disclose the new 
limitations of the independent claims, which are addressed above in the rejections. 

Conclusion 

37. The following is text cited from 37 CFR 1.111 (c): In amending in reply to a 
rejection of claims in an application or patent under reexamination, the applicant or 
patent owner must clearly point out the patentable novelty which he or she thinks the 
claims present in view of the state of the art disclosed by the references cited or the 
objections made. The applicant or patent owner must also show how the amendments 
avoid such references or objections. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Shane F Gerstl whose telephone number is (571) 272- 
4166. The examiner can normally be reached on M-F 6:45-4:15 (First Friday Off). 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Eddie Chan can be reached on (571 ) 272-4162. The fax phone number for 
the organization where this application or proceeding is assigned is 703-872-9306. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 
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